# Swin-BART architecture
OCR DocVQA Donut
MIT
Donut is an OCR-free document understanding Transformer model that combines a visual encoder and text decoder for document visual question answering tasks.
Image-to-Text
Transformers

O
jinhybr
240
13
OCR Donut CORD
MIT
Donut is an OCR-free document understanding model based on Swin Transformer visual encoder and BART text decoder, this version is fine-tuned on CORD receipt dataset
Image-to-Text
Transformers

O
jinhybr
1,130
206
Featured Recommended AI Models